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Abstract 

We provide an improvement over Meshulam's bound on cap sets in . We show 
that there exist universal e > and C > so that any cap set in has size at 
most Cj^TTT- We do this by obtaining quite strong information about the additive 
combinatorial properties of the large spectrum. 







1 Introduction 

A set A C is called a cap set if it contains no lines. In this paper, we will be concerned 
with proving the following theorem: 

Theorem 1.1. There exists an e> 0, and C < oo such that if A C F^ is a cap set, then 

\A\^ C 



3Ar - ^1+,- 

The problem of the maximal size of cap sets is a characteristic 3 model for the problem 
of finding arithmetic progressions of length 3 in rather dense sets of integers. Meshulam 
|M95j , through a direct use of ideas of Roth, was able to show that there is a constant C 
so that any cap set A has density at most Our result may be viewed as a very modest 
improvement over Meshulam's result. 

Sanders |S11] recently showed that any subset of the integers whose density in {1, ... , M} 

is at least '^^^°og°f/^'' must contain arithmetic progressions of length 3. This may be 
thought of as bringing the results known for arithmetic progressions almost to the level of 
Meshulam's result. This has spurred further interest in improving Meshulam's result in 
hopes that it might suggest a way of improving the results on arithmetic progressions. 



^MSC 2010 classification: 11T71,05D40 
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A rather concrete, though perhaps still out of reach, goal in this direction is a conjecture 
of Erdos and Turan: 

Conjecture 1.2. Suppose A QI^ is such that 

E — = DO. 

Then A contains an arithmetic progression of every length. 

It is clear that the present paper is directly relevant only for finding 3-term progressions. 
However it is also easy to see, based purely on density considerations, that proving an 
estimate of the type in Theorem 11.11 in the integer setting would yield the 3-term case of 
the conjecture stated above. In fact. Polymath 6 |PM6j has recently been started with 
the goal of adapting the ideas of this paper to the integer setting. See |PM| for more 
information about so-called "polymath" projects in general. 

While the research in this paper was well underway, Gowers [G] wrote a post on his blog 
suggesting that one could attack the problem of bounding cap sets by studying the additive 
structure of their large spectrum. This had been our approach as well and we wrote a reply 
[Kl] sketching our rather strong results regarding that structure. In the course of a few 
days, we realized that we actually could convert our structural theory into an estimate on 
the size of cap sets. We recorded this [K2] in a second reply to Gowers's blog. The current 
paper should be viewed as an elaboration of these two posts. 

We describe our plan for proving Theorem ll.il To prove this theorem we will prove a theo- 
rem about sets without unusually dense subspaces, a notion we make precise below. 
Definition 1.3. We say a set A has no strong increments if for every subspace V C 
with d = codiml/ < have 

\AnV\ 20dp 
\V\ N ■ 

Theorem 1.4. There exists an e > 0, and C < oc such that if A CI is a cap set with 
no strong increments, then 

\A\ ^ C 

Major ingredients needed to prove this theorem are Proposition 13.31 Lemma 15.31 and 
Theorem 17.11 We combine them with a Fourier analytic argument in Section [HI 

Proof. We deduce Theorem 11.11 from Theorem 11.41 using induction. Suppose that for every 
n < — 1 we have shown that if B Q F^ is a cap set, then 



We aim for a contradiction: assume there exists a cap set A C such that > j^^t- 
By Theorem 11.41 this imphes A has a strong increment. Since A has a strong increment, 
there exists an afhne subspace V Q with codimension < ^ such that 

\AnV\ 20dp 

\v\ - 

C 

> 



since the derivative of -^^f^ is uniformly bounded by IGCN^^^'' = 16pN^^ whenever < 
e < 1. But we know that Any (in fact any subset of A) is a capset. This contradicts the 
induction hypothesis, yielding Theorem ll.il □ 



1.1 Proof Sketch 

We sketch our plan for proving Theorem 11.41 

We will study the large spectrum A of a cap set A with no strong increments. The reader 

nN 

should think of A, for a cap set of the size ^ given by Meshulam's estimate, as consisting 
of the positions at which the absolute value of the Fourier transform of A is around 
The set A should have cardinality approximately A'^^ (established in Section [3]) and have 
about A^^ additive quadruples (established in Section UD . Recall that an additive quadruple 
is a quadruple [xi, X2, X3, X4) of elements of A with the property that 

Xl + X2 = X3 + X^. 

Similarly an additive octuple is an octuple (xi, X2, ^3, X4, X5, xg, X7, xg) of elements of A 
with 

Xi + X2 + X3 + X4 = X5 + Xg + Xy + Xg. 

It is easy to see a priori that a set with many additive quadruples will have many additive 
octuples. In our case, a set with size A^^, having A^^ quadruples must have at least 
octuples. (But it may have more octuples.) The number of octuples it has should be taken 
as an indication of its structure. If there are many octuples, it means that the sumset 
A + A looks like it has more additive structure than the set A. We then say the set A 
is additively smoothing. (It becomes smoother under addition.) We show, however, that 
this cannot be the case for A, the large spectrum. We use the probabilistic method to 
do this, finding too much of the spectrum contained in a small subspace in the additively 
smoothing case. We establish this in Section [5j (This is somewhat reminiscent of the paper 
of Croot and Sisask (CSll] where random selections are used to uncover structure.) 
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Thus our set A is entirely additively non-smoothing. This means it is ah'eady as smooth 
as it will become under a small number of additions. This makes its additive structure 
particularly easy to uncover as it is already present without adding the set to itself. This 
kind of idea was first exploited in a paper of the second author with Koester |KK10j , and 
we use techniques quite similar to those found in that paper. We end up showing that 
the set A should be thought of as looking like the sum of a very structured set K of size 
( that is to say that K is almost additively closed) with a very random set A of size 
N"^. Section [6] contains the proof of a structural theorem for sets with substantial additive 
energy (i.e., many additive quadruples) but no additive smoothing. 

We find that this structure of A is inconsistent with A's being a cap set with no strong 
increments. The reason is that we can use Preiman's theorem to place K inside a subspace 
H with relatively low dimension. We can essentially mod out by , examining the "fibers" , 
the intersections of A with translates of H^. We find that the structure of A makes the 
behavior of the fibers unrealistic. This argument is suggested by a paper of Sanders |S10| . 
and is carried out in Section [HI 

One final remark about the value of e obtained in this paper: it is necessarily rather 
small (at least with our current argument). We discuss the reasons for this in a brief final 
section and give some conjectures that, if true, would greatly improve the efficiency of our 
argument, we have not attempted to optimize e (or even keep track of exact dependence 
on e) throughout the paper. 

Acknowledgements The first author is supported by an NSF postdoctoral fellowship, 
DMS-0902490. The second author is partially supported by NSF grant DMS-1001607. 

The authors also thank Izabella Laba and Olof Sisask for pointing out an error in the 
statement of the asymmetric Balog-Szemeredi-Gowers theorem in a preliminary version of 
this paper. 



2 Preliminaries 

In this section, we record a few general results of which we will make frequent use through- 
out the paper. We begin with our favorite form of the Cauchy-Schwarz inequality. 
Lemma 2.1. Let S and T he finite sets and p a map 

p:S^T. 

Let P be the set of pairs 

P = {(si,S2) : p{si) = P{S2)}- 

Then 

ipi>ffi. 
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Proof. Note that we can express |-P| by 

teT 

Applying the Cauchy-Schwarz inequahty we see 

Here we introduce another variant of Cauchy Schwarz: 

Lemma 2.2. Let {X,m) be a measure space with total measure M. Let Ai, . . . A), he 
measurable subsets of X and < p < 1 be a number (the density), so that m{Aj) > pM 
for each j. Suppose k » K Then 

Proof Note that 

k 

'^m{Aj) = pkM « k^p^M, 

since kp» 1. Thus we may estimate the full sum 

k k 

j=i 1=1 

Define c{x) to be the measurable function giving for each x, the number of sets Aj which 
contain x; i.e., 

k 
3=1 

Thus we would like to estimate 

□ 



□ 
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In what follows /x shall be a small exponent. We will frequently use expressions like N'^(^\ 
The exponent will be bounded by C/i for C a universal constant which varies from line to 
line in the paper. Wc illustrate this by the following version of the large families principle 
(this is the principle which says that most children belong to large families) which will be 
used extremely often in this paper. 

Lemma 2.3. Let Mi, . . . Mk > &e real numbers and let R > be a real number. Suppose 
that Mj < RNi^ for each j and suppose that 

K 

Mj > RKN-". 

Then there exists a subset J of {1, ... , K} with \ J\ > N~'^^>^^K so that for each j G J, we 
have Mj > N'^^t^'^R. 

Proof. Let 



J = {j: Mj > iV-iO'^i?}. 
Suppose that \J\ < N~^^^^K. Then by the upper bound on Mj, we have that 

^ Mj < N-^^'RK, 
jeJ 

while 

Mj < N-'^^i'RK. 

Combining these two estimates gives us a contradiction. □ 

We take a moment to state the asymmetric Balog-Szemeredi-Gowers theorem which we 
will have occasion to use. A set 5 c will be said to be /x-additively closed if 

\B - B\<N^^^'\B\. 

Lemma 2.4. Let B,C C so that there are at least N~^\B\\C\'^ additive quadruples of 
the form 

6l + Ci = 62 + C2 

with 61,62 £ B and ci,C2 G C. Let L = and assume L < N^^ . Then there exists 
fi depending only on r), with fj, as r] 0, and there exist an fx-additively closed set 
K C and a set X C F^ with 

\X\ < ivO(M)M 
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so that 

\Br\{X + K)\ > N-'^^'''>\B\, 
and an element x G so that 

\cn{x + K)\ > iv-^('')|C|. 

In particular, the last inequality implies 

\K\ > iV-^(^)|C|. 

An only slightly stronger form of this lemma appears in the book of Tao and Vu as Lemma 
2.35. |TV06j Another way of stating this result which we will use frequently is to define 

a function / with limj f{t) =0 and to let = /(??). We will use this kind of notation 

frequently in the paper with the choice of / varying from line to line. 

Finally, we record the form of Preiman's theorem which we shall use. 

Theorem 2.5. A fi additively closed set is contained in a subspace of dimension A^'^(^'). 

Various improvements of the finite characteristic Freiman's theorem have occured such as 
the result by Sanders |S08j but these only affect the constant in our formulation. Even 
Ruzsa's original version [R99] suffices. 

3 Review of Meshulam's argument, Fourier analysis in F^^ , 
and sparsity of the spectrum 

For the remainder of this paper A will be a subset of with |^| = p3^ » -^rj^ with 
e > to be determined later. Moreover A shall be a cap set meaning that it contains no 
lines. A line in F^ is characterized by being a set with exactly three distinct elements 
a, b, c satisfying a + b + c = 0. 

In this section we will establish some basic facts needed for our proof, and that are enough 
to obtain Meshulam's bound of ~ on the density of capsets. Further, we shall prove 
a statement of the form "The spectrum A does not have too much intersection with any 
small subspace." 

We will assume that there are no strong increments for A in the sense of Definition 11.31 
Precisely, we assume there is no hyperplane H so that ACi H has density > /o + ^ in H 
and no subspace H of codimension d < ^ so that ACiH has density > p + . We recall 
that a contradiction of this assumption will mean that every large cap set A has strong 
increments. This will contradict the existence of large cap sets. 
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We define the character e : Fy, — ^ C by e(0) = 1, e(l) = -\ + ^i, e(2) = -\ - ^i. We 
will study the Fourier transform of the set A, namely 

As a consequence of the assumption that A is a large cap set without strong increments, 
we shall see that there is a significant set A of a; for which is fairly large (the set A 

will be called the spectrum of A) and we shall see that A does not concentrate too much in 
any fairly low dimensional subspacc of . Our first nontrivial fact about A is that A — p 
has large norm, and moreover that this large norm is accounted for by the set of x 
where is large. Precisely: 

Definition 3.1. Define 

A = {x^O: \A{x)\>p'}. 

We shall refer to the set A as the spectrum of A and it shall be our central object of study 

for the remainder of the paper. 

Note that with this definition, A is symmetric, that is 

A = -A. 



It is worth noting that the following proposition is the only place in the paper where wc 
use the assumption that A is a capset specifically; the other parts of the paper use only 
the assumption that A has no strong increments. 
Proposition 3.2. If A is a capset, then 

Y.\A{xt>p\ 



Moreover, 



and 

< |A| < Ar^+^^ 

Note that this proposition is already enough to obtain Meshulam's estimate: in particular, 
it guarantees that the set A defined in the statement of the proposition is nonempty. This 
means there is at least one x such that > p^. This guarantees the existence of a 
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hyper plane P such that the density oi An P inside P is at least p + cp^. Taking p large 
enough compared to jj already contradicts the no-strong-increment hypothesis, yielding 
Meshulam's estimate. 

In what follows, we prove this proposition. We consider 

xeF^ xeF^ a&A feeA ceA 

Summing first in x, we see that this expression yields 3'"^^ mutiplied by the number of 
solutions of the equation a + 6 + c = with a, b, c taken from A. Since we have assumed 
that ^4 is a cap set, the only solutions occur when a = b = c. Thus 

^ A{xf = 3-2^1^1 = 3-^/>. 

However, we observe that A{0) = p. Thus, given the size of A, we see that p^ dominates 
3~^p and we conclude 

E 1^(^)1'^^'- (3-1) 

However, following the proof of Plancherel's inequality, we see that 

E 1^(^)1' < E 1^(^)1' = ^ E EE^((«-^)-^)- 

Summing first in x, we conclude 



x¥=0 x&Fi' x£Ff (leA f)6B 



^\Aix)\'<3-''\A\<p. (3.2) 

x^O 

By the assumption that A has no strong codimension 1 increment, we conclude 

\M-)\ ^J§7 = W (3-3) 

Recall that 

A = {x^O:\A{x)\>p% 

By selecting the implicit constant in the definition of A correctly, we see combining in- 
equalities 13.11 and 13.21 that 

E 1^(^)1'^^'- (3.4) 
xeA 
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Combining inequalities 13.31 and 13.41 we see that 

lAI > N^. 



Combining the definition of A with 13.21 we get 

|A| < iv3+3^ 

Now we prove a statement of the form "The spectrum A does not have too much intersection 
with any small subspace." Precisely: 

Proposition 3.3. Let A be a set without strong increments. Let A be the spectrum, as in 
Definition \3.1[ Then for any subspace W of having dimension d < we have 

|An VF| < div^+2^ 

Moreover for such a subspace W , we have the estimate 



Here we see what the assumption of no higher codimension strong increments implies about 
the spectrum A. Let iif be a subspace with codimension d < we let 1/ be a dimension 
d subspace which is transverse to H (i.e., V + H = F-^) and we let W be the annihilator 
space of H. Then for any ii; 7^ G W , we see that 

= ^Y.^\Ar^{H + v)\- 3- • w). 

Then we have 

\A{w)\^ = ?>'^-'^^"Y{\Ar\{H + v)\-?,-^\A\f. (3.5) 

By getting an upper bound on the right hand side of equation 13. 5^ we can obtain an upper 
bound on |A n W\, which is our goal. 

To estimate the right hand side, we subdivide V = U V~ ^ where 

V+ = {v:\Ar\{H + v)\- 3~Vl > 0} 

and 

V- = {v:\Ar\{H + v)\- ^~'^\A\ < 0} 
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We observe that since 

J2{\An{H + v)\-3-^\A\) = 0, 

we have 

\\An{H + v)\-3-'^\A\\= \\An{H + v)\-3-'^\A\\. (3.6) 
vev+ vev- 

We observe that the hypothesis that A has no strong increments impUes that for v £ , 
we have the estimate 

\\An{H + v)\-3-''\A\\<3-'\A\^. 
Thus simply using that \V\ = 3"^, we get the estimates 

Y \\An{H + v)\-3-''\A\\<\A\^ (3.7) 

v£V+ 

and 

J2 \\A n{H + v)\- 3-^11' <^-'\A\'^ (3.8) 

v£V+ 

Now for V G V~ , we have the trivial estimate 

\\An{H + v)\-3-'^\A\\ <3-Vl- 

In light of equation 13.61 and estimate 13.71 this yields 

J2 \\An{H + v)\-3-''\A\\'<3-'\A\'^ (3.9) 
vev- 

Combining inequalities 13.81 and 13.91 gives the estimate 

\\An{H + v)\ - 3-Vll^ < 3-Vl^|r- 

Thus equation 13.51 gives 
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However, we recall that if G A, we have that 

1 



\A{w)\>p' 



_/V2+2e ■ 

Thus we get the desired estimate: 

|A n w| < c^iv^+2^ 

4 Additive structure in the spectrum of large cap sets 

In this section we establish that the spectrum has some nontrivial additive structure. 

Specifically, we prove it has A^'^~'^(^) additive quadruples. 

Proposition 4.1. Let A he a large cap set. Let A he the spectrum of A and let A' he any 
symmetric subset of A with 

lA'l > i|A|. 

Let E/^iA!) he the numher of additive quadruplets xi+X2 = x^ + x^ with xi,X2, X3, X4 G A'. 
Then 

The argument for a major subset A' of A is no different, so for convenience of notation we 
assume in fact A' = A. 

We retain the notation of the previous section considering A the spectrum of a large cap 
set A. In particular, we have |^| >> j^rf^, we have 

< |A| < iV^+^^ 

we have for every x £ A, that > jv^' have that A is symmetric, namely 

A = -A. 

From the lower bound on |^(a;)|, we have for each x, an affine hyperplane Hx, annihilated 
by X so that 

l-4nff.|-lw>3''^ = i^. 

Summing over A, we obtain 
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We wish to rewrite this as a double sum by introducing Ih^, the indicator function of 

We interchange the order of the sums: 
Now we apply Holder's inequality: 

iA|}(EiE(i".(!')4'i'''Sw- 

y£A zeA 

Taking everything to the fourth power and simplifying, we get 

y£A xeA 

Crudely expanding the sum, we get the apparently weaker inequality 

|A||A|4 



We can rewrite this as 



E E nL,(iH.j,)-i)>^. (4.1) 

We claim the estimate 14.11 says that the spectrum A has substantial additive structure. 
This will be demonstrated by the following proposition. 

Proposition 4.2. Let xi,X2,x^, and X/^ he non-zero elements of . Then the expression 



y&Fi' 
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Moreover it vanishes unless an equality of the form 

zbxi ± X2 ± ± 3:4 = 

holds. 



Proof. We introduce the Fourier transforms of the balanced function of the hyperplanes 
H^^ setting 

Then we use the standard Fourier identity 

^ nLl(l//.j2/) - ^) = 3^ /l(^l)/2(^2)/3(^3)/4(^4). 
yeF^ 21+22+23+24=0 

We observe that fj{zj) vanishes unless zj = ±rcj. 

The upper bound on the sum just follows from the triangle inequality. 

□ 

To finish the proof of Proposition 14.11 we apply the inequality 14.11 the fact that |^| >> 

Q AT 

-jST+^, the proposition 14.21 and the fact that the spectrum A is symmetric. 



5 Random selection argument for additively smoothing spec- 
trum 

In this section we study the additive properties of random subsets of the spectrum. We will 
show that they typically have very poor additive structure. This will allow us to conclude 
that, although the spectrum has many 4-tuples, it cannot have too many 8-tuples. The 
significance of this will only be made clear in Section El 

We defined £'4(A) to be the number of additive quadruplets in A. We define E2m{^) to 
be the number of additive 2m-tuples 

X1+X2 + ...Xm = Xm+1 + Xm+2 + ■■■ X2m, 

such that xi,X2, ■ ■ ■ , X2m £ We let A(x) be the Fourier transform: 

xeA 
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Then 



which can be sen by expanding the sum on the right, as in the proof of Plancherel's theorem. 
We always have £'2 (A) = |A|. When we have nontrivial amounts of additive structure in 
the sense that say i?2fc(A) >> jAj'^, we can hft this up to counts of higher-tuplets using 
Holder's inequality. (We use the inequality to bound the 2A;-norm by the 2-norm and the 
2m-norm.) We can view this process as a poor man's Plunnecke theorem. We record this 
result for high £'4 and high E^. 



Lemma 5.1. Let m > 2. Then 



Suppose m > 3. Then 



|A 



m-2 



Es{A) 



m— 1 
~3~ 



, m — 4 

A — 



< ^2,n(A). 



< ^2,n(A). 



5.1 Discussion of additive smoothing 



We are now ready to introduce the notion of additive smoothing. We keep in mind two 
examples of kinds of sets having additive structure. One kind of set consists of a subspace 
plus a random set. The other consists of a random subset of a subspace. We think of 
the first kind of set as not being additively smoothing because as you add it to itself, its 
expansion rate stays essentially constant. This is the kind of example for which Lemma [5. II 
is close to sharp. But the second kind of set, when added to itself, will quickly fill out the 
subspace and its rate of additive expansion will shrink dramatically. The lack of additive 
structure smooths out under addition. This is the kind of example for which Lemma l5.ll 
is far from sharp. 

We will momentarily define A to be additively smoothing if £'8(A) is substantially larger 
than expected from Lemma 15.11 and our lower bound for E^^A) obtained in Section HI 
(Nonetheless, for the purposes of this paper, the gain in the exponent need only be 0(e).) 
We will define additive smoothing so that if A is additively smoothing then for some not 
very large m (depending only on e), we may expect to find additive m-tuplets of A in a 
randomly chosen set S of d elements. 

Before we formally define the property of additive smoothing, we illustrate how the calcu- 
lation works in the case e = 0. In that case d ^ N so that an element of A (which has size 
A^'^) is chosen with probability N~'^. We have a lower bound of A^^ on E4{A). Suppose 
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that we can improve on the lower bound of N^^ for Eq{A) which we get from the first part 
of Lemma 15.11 and in fact 

Es{A) > iV^^+^ 

for some S > 0. Then from the second part of Lemma |5. 11 we obtain the estimate 
Thus there is some m which depends only on S so that 

Thus the expected number of 2m-tuplets in S" is >> 1. We will formally define additive 
smoothing to achieve the same effect when e is different from 0. 



5.2 Nonsmoothing of the spectrum 

In this subsection we make rigorous the arguments of the last subsection. 

Definition 5.2. We define A to be additively smoothing if there is some a > 30e so that 

^8(A) » iVl5+<^. 

We are now in a position to state the main result of this section. 

Lemma 5.3. If A is the spectrum of a large cap set without strong increments then A is 
not additively smoothing. 

We begin with a few comments about our proof strategy in this section. If S* is a "random" 
subset of A, then we expect 

Thus we can show that £'2m(^) is small by showing that Es is small for a typical (somewhat 
large) subset 5 of A. 

Now we fix a particular number d and consider random subsets of A of size d. We will 
take 

d--N^-' (5.1) 

with the explicit constant to be determined later. Our first goal is to prove that we expect 
this subset to span a space of dimension d. More precisely: 

Definition 5.4. Let be a set of d vectors xi, . . . G . We say that the set S has 
nullity k if the dimension of the span of S is d — k. 

We will consider uniform random selections of sets of d elements from A. We can view 
these selections as d-fold repetitions of uniform selection without replacement. We will 
prove 
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Lemma 5.5. A random selection S of size d from the spectrum A has nullity at least k 
with probability < 2^'^. 

Proof. Once we have completed our first m choices, our selections vector 
space Wm with dimension no more than m. Thus |An Wm,| ^ mA^^"'"^'^ by Proposition 13.31 
We choose the constant in 15.11 so that the probability that the m + 1st element of S lies in 
Wm is bounded by ^ for all m < d — 1. Note that since m < d, this probability is bounded 
by 

CdN^+^' 
Cd 

iV2-2e 
1 

d 

provided d « N-^~''. Thus the probability that S has nullity at least k is bounded by the 
probability that for d independent events with probability ^ at least k occur. 

The probability that exactly k events from d independent events with probability ^ occur 
is exactly 

The numbers g{k, d) decrease by a factor of more than 2 as /c is increased by 1 as long as 
k > 2. This completes the proof of the lemma. □ 



\Anw„ 

lAI 



< 



< 



Now that we know our random subset is likely to have full rank, we estimate the number 
of 2m-tuples it contains in the case it does not have full rank. Given a set S with nul- 
lity k we will bound the number of possible additive 2m-tuplets between elements of S. 
Specifically: 

Lemma 5.6. A set S of size d and nullity k has E2m{S) ^ Cmk'^"^. 

Proof We write a list E of all equations among elements of S which involve 2m or fewer 
elements of S. Because the nullity is k, the span of these equations has dimension at most 
k. We pick a basis B for E and the equations in B involve at most 2mk elements of S. 
Thus all of the equations of E involve at most 2mk elements of S. Thus there are at most 

him,k)=2"^i2m)\(^^J^y 

additive 2m-tuplets from S. We refer to /i(m, k) as the number of possible m-tuplets in S. 
Note that h{m, k) is a polynomial of degree 2m in k. □ 
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Proof of Lemma \5.3\ . Now let 5 be a random selection of d elements from A. Then by 
Lemma 15.51 the probability that S has nullity A; is < 2^^^. Thus the expected value of the 
number of possible 2m-tuples Ylk>o h{m, k) is Ylik>Q '^~^k'^^ < 1. 

Now we will show that we have defined additive smoothing so that the expected number 
of 2m-tuples is >> 1. This will give us a contradiction. 

We know that d > N^'^. Thus our selection S will be expected to have >> 1 non-trivial 
2m-tuples, whenever £^2m(^) >> A^^™+2™^. (|We simply calculate the probability that an 
individual 2m-tuple involves only elements of S.) Thus we may assume that 



Using the fact that |A| < A^^+Se g^j^^ ^j^^ second part of Lemma |5.H we get that 

15m + 27me — 1 — e 

Es{A)<N . 
Choosing m sufficiently large gives 

Es{A) < iVi5+27.. 
The choice of m and hence the constants depend on e but not on N. 

□ 



6 Structure of robust additively non-smoothing sets 



In this section, the only properties of the spectrum A which we shall use are its size, its 
additive structure, and its non-additive smoothing. Consequently the results can be stated 
in somewhat more generality. We leave intact, however, the numerology coming from the 
case of spectrum of cap sets. 

We will say that a symmetric set A C is a robust additively non-smoothing set of 
strength 6 provided that we know its size: 

< |A| < N^+' (6.1) 

that we know how many additive quadruples can be made from any large subset of it, 
namely that if A' C A with |A'| > ||A| and A' symmetric, we have 

E^{A') > N'-^ (6.2) 

and that we have additive non-smoothing, namely 

Es{A) < N''+' (6.3) 
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and moreover that for each element a E A, there are at most A^^+^ quadruples of the 
form 

±a±b = ±c±d 

with b,c,d£ A. 

Let us pause to consider the case of A, the spectrum of a cap set with no strong increments. 
We know that the number of a E A participating in more than A^^+*^("^) quadruplets 

±a±b = ±c±d 

is smaller than since otherwise A would have more quadruples and hence more 

octuples than allowed by Lemma l5.3[ Let A' be the remaining elements of A. Note that 
by its definition A' is still symmetric. Note that any symmetric subset of A' containing at 
least three fifths of its elements must contain at least half the elements of A. Thus from 
Proposition 13. 2| Proposition 14.11 and Lemma 15.31 we know that: 

Proposition 6.1. Let A be the spectrum of a large capset with no strong increments. 
There is a subset A' of A so that A' is a robust additively non-smoothing set of strength 
0{e). 

Returning to the setting of robust additively non-smoothing sets, we let, for the remainder 
of the section, the set A be a robust additively non-smoothing set of strength 5. 

Given a value x E A — A, we define m{x) to be the number of pairs (a, 6) E A x A so that 
a — b = X. Clearly we have 

E^iA) = m{xf. 
xgA-A 

Given a robust additively non-smoothing set A of strength 5, for each a, we may define 
C A X A, by 

Ga={{a,b) € Ax A: iVi+" < m{a, b) < 2iVi+"}. 
By the dyadic pigeonhole principle, there is an a so that 

j^j-G—a—S 



\Ga\> 



loe-N 



Moreover, we know that no a in A participates in more that N^'^^ quadruples. Thus no 
element a in A participates in more than Jv3-"+'5 pairs in Ga- Thus there are at least 
~ ^^gj^ elements of A each of which participate in at least ~ ^LgTV^'^ pairs in Ga , by the 
large families principle (Lemma l2.3p . 

We now forget about optimizing our exponents and consolidate this information in a single 
definition. 



19 



Definition 6.2. We say that (A, G, D) is an additive structure at height a with ambiguity 
7] if the following hold. We have 

|A| < iV^+''. 

We have 

G C A X A 

with the property that for each (a, 6) G G we have that a — b £ D, and so that each 
d £ D has ~ A^i+° representations as a difference of a pair in G. We have |G| ~ 
Moreover there are at least N^~^ elements of A participating in at least A^^-a-?? g^j^is 
each. Finally there are no more than N^^'^^ additive octuples among elements of A. 

We summarize what we have shown so far in a proposition. 

Proposition 6.3. Given a robust additively non-smoothing set A of strength 5 we may 
find G C A X A, and D C A — A and a > so that (A, G, D) is an additive structure at 
height a and ambiguity 0{6). 

We now describe a slightly deeper property of additive structures at height a and ambiguity 
r]. Given a structure {A,G,D), for each x £ D, we define the set Ag'[x] to be the set of 
a S A so that there exists b £ A with (a, b) £ G and a — b = x. In light of our definitions, 
we have for each x £ D that |Ag'[x]| ~ N^^'^. We consider the quantity 

K{A,G,D) = Y.Y1 n AG[y]|. (6.4) 

Clearly K{A,G, D) counts the number of triples {a,x,y) with a £ Ag[x] and a £ Acly]- 
Each element in a is contained in exactly as many sets Ag[x] as it participates (in the first 
position) in pairs in G. Thus we conclude that for (A, G, D) an additive structure with 
height a and ambiguity rj that 

K{A,G,D) = Y^\{x£D:a£ Ag[x]}\^ > N^'^'^-'^ . 

aeA 

Now examining the equation (16.41) and dyadically pigeonholing, we observe that we can 
find P so that there are at least ]\i^-^"'-l^-'^v pairs (x, y) so that for each such pair, we have 
iV^ < \Ag[x] n AgM < 2N^. 

Definition 6.4. We say that the additive structure (A, G, D) at height a and with ambi- 
guity r] has comity /i if we can find the abovementioned (3 with /? > 1 + a — /x. (Note that 
of course /3 < 1 + a because \Ag[x] D AgMI < |Ag[x]| < N^+"'.) 

Lemma 6.5. Given an additive structure (A, G, D) at height a and with ambiguity rj either 
it has comity n or there is an additive structure (A, G', D') with height /3 — 1 < a — /i and 
ambiguity 0{r]). 
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We remark that this lemma contains the key use of the nonsmoothing hypothesis, which 
is hidden in the definition of "additive structure" . 

Proof. We dyadicahy pigeonhole the equation ()6.4p to find /3 so that there is a set of at least 
pairs (x, y) so that for each such pair, we have N^^ < \Ag[x] n AgM < 2N^ . 
If it happens that /3 — 1 > a — fi, then we are done. Otherwise, we will construct an additive 
structure at height /3 — 1. 

Now any time that a £ Aq [x] H Aq [y] , this means that we can write x = a — b and y = a — c. 
Thus we have x — y = c — b. We have between A^^ and 2N^ representations of the difference 
X — y. It remains to determine how many such differences there are. 

We have two distinct upper bounds on the number of such differences. First there are 
< ]\[G-l3+2v^ since each difference is represented by ~ A^'^ pairs in A x A and there are 
only such pairs. The second estimate is that there are < ]\i'^-'^l^+0(v) many such 

differences, because otherwise E4{A) would be much larger than A^^ which would make 
^8 (A) larger than N^^+'^. The first upper bound is most effective (ignoring ambiguity) 
when /3 < 1 while the second is most effective when /3 > 1. Our plan (modulo ambiguity) 
is that we shall rule out the case /3 < 1 and that we shall show that the second upper 
bound is tight up to a factor of N^^'^\ Both estimates will follow from the upper bound 
on E^{A) and the Cauchy-Schwarz inequality, namely Lemma l2. 11 

Since we have N^-'^°'~/^-0{'n) pairs (x, y) with at most A^6-/3+o(j7) differences, by the 
Cauchy-Schwarz inequality, there must be at least ]\[^'^-'^°'-P-0{'n) additive quadruples in 
D, namely x — y = x' — y' . (Here we let S be the set of pairs {x,y), we let T be the set 
of differences with ~ representations as difference of A and we let p be the difference 
map, p{x,y) = x — y. Then we can apply Lemma |2.1[ ) However since each difference x,y 
can be represented in A^^+'^ ways as a difference in A, we can represent each quadruple 
in D as an octuple in A in N^+^°' ways. Thus there are at least A^i6~^~'^('?) many such 
octuples which implies 13 > 1 — 0{ri). 

Thus we are in the regime where the estimate that there are at most A^'^^ 2^+0 (77) many 
differences is most effective. Suppose that there were only A^7-2/3-7 

many such differ- 
ences with 7 >> r/ Then apply Cauchy-Schwarz again, we would see that there are at 
least _/\rii-4«-0(r;)+7 mg^j^y quadruples in D which implies N^^~^^'^^~^"' octuples in A, a 
contradiction with the nonsmoothing hypothesis in the definition of "additive structure" . 

Thus taking D' to be the differences x — y obtained from {x,y) so that A^^ < |A[x] n 
A[y]\ < 2N^ and taking G' to consist of representatives of these differences coming from 
the intersections (as in the second paragraph of this proof), we obtain an additive structure 
(A, G', D') with height /3 — 1 < a — /i and ambiguity 0(r/). 

□ 
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Corollary 6.6. Given an additive structure {A,G,D) at height a and with ambiguity r] 

there is an additve structure {A,G,D) at height a' < a with ambiguity /i and comity /i 

with a < T-ir ■ 
^ ^ log i 

Proof. We iteratively apply Lemma 16.51 with comity /i fixed by 

K 



log J 



with K a large constant, and with the ambiguity increasing by a constant factor C in 
each iteration. Since a decreases by fi each time we don't find comity we need only ~ ^ 

iterations to achieve comity. At this point, we have ambiguity given by C k t] « /j,, as 
long as K was chosen sufficiently large. □ 

Now we begin to investigate what we can say about the shape of the set H of all pairs (6, c) 
in A X A having the property that b — c has at least A^i+"~'^(^) representations in A x A for 
(A, G, D) an additive structure with height a and ambiguity and comity /x. We will find 
that the set H is rather thick in a product set whose projection has size A^3-Q!-0(^t)_ 
Lemma 6.7. Let (A, G, D) be an additive structure with height a and ambiguity and comity 
11. Then there is a subset B C A with \B\ > A^3-a-0{/i) ^/^^^ there is a set H C B x B 
with 

so that for any (6, c) G H , the difference b — c has N^+"~^(t^) representations in A x A. 
Proof. From the hypotheses, we have that 

E \^g[x] n AgM > iv9-2"-o(M), 

and that there are at least N^~^"^^^^^ pairs (x, y) for which 

Using the pigeonhole principle, we fix one value of x for which there are 

N^-^-^M choices 

of y so that 

|AG[x]nAG[y]| >iVi+"-^(^). 

Again using the pigeonhole principle, we find an a G A and a set y C -D so that a G Aciy] 
for every for every y gY, so that |y| = A^3-Q-0(At) ^jjg^^ fg^ each y gY, we have 



|AG[x]nAG[y]| >iV^+°-^(^). 
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We notice that by definition a — Y C A. We choose B = a — Y Finally since each AG[y] 
has relative density A^~'^(^) in Ag'[x], we have by Cauchy-Schwarz (Lemma I2.2p that 

^ \AG[y]nAG[y']\ > |y|2Ari+-oM. 

y&Y y'eY 

This implies that B satisfies the conclusion of the lemma. The reason is that by Lemma 
\T3[ we have a set Y of pairs y,y' so that |A[y] n A[y']\ > N^+»-OM ^ xhis implies that 
a — y' — {a — y) = y — y' has at least N^+°'^0{^l) representatives as a difference of two 
elements of A. □ 

Now we are going to use Lemma 16.71 repeatedly to show that for any robust additively 
non-smoothing set of size 5 we can find an additive structure of ambiguity ry with 77 < --^ 

which breaks into dense blocks. 

Lemma 6.8. Let A he a robust additively non- smoothing set of strength 5. Choose fj, ~ 
j~r- Then for some < a < 1, there is an additive structure {A,G,D) of height a and 

ambiguity /i and disjoint subsets Bi, . . . Bx of A with each Bj satisfying \Bj\ < iv3-a+0(/j) 
so that 

K 

G<l[jB,xB,. 

i=i 

Note that since we are requiring that {A,G,D) be an additive structure, this requires \G\ > 
jsi6-a~o{ii) ^^^^^ implies K > m^^^f) . 

Proof. Using Proposition 16.31 , Corollarv 16.61 Lemma 16.71 We can find a subset Bi of 
A so that for some choice of a, it has size at most A^3-a+0(/x) ^^^^ nevertheless Bi x Bi 
contains at least A^6-2a-0(/^) pg^jrg whose differences have A^i+"^'^(^) representations as 
differences in A. 

Having done this, we use the robustness property of A to apply the same argument to 
A\(i?i U —Bi). We continue removing sets from A until we have exhausted half of A. Now 
one difficulty is that the disjoint sets B which we chose do not all have the same a. We 
use dyadic pigeonholing to resolve this for only a small cost in the number of sets. We 
call these sets Bi, . . . , Bk- Now Lemma 16.71 guarantees us in each Bj x Bj, a subset Hj 
of cardinality at least A^6-2a-0(At) n^g^^ each difference in Hj is represented in A x A at 
least A^^+"~*^(/^) times. We denote by Da the set of differences represented in A x A at 
least N^+°'~^(t^') times, and note that 

I n I < /v5"2Q:+0(At) 

lest there be enough quadruples in A to violate the additive non-smoothing condition. 
Using the large families principle and pigeonholing, we find some a' > a — 0{^) so that 
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at least N^~'^°^~^(^^) differences are represented at least N'^^"'' many times in DjHj. We 
denote this set of differences as D'^. We let D be D'^ and let G be a subset of UjHj 
consisting of A^^+" representatives of each difference in D. 

□ 

Our goal now will be to use will be to use Lemma 16.81 to find almost additively closed 
sets E of size at least N^~^^^^ inside robust non-additively smoothing sets of strength 6. 

Here / : [0,1] — > [0,oo) is some function with limj s>o/(^) = 0. We will be employing 

such functions from now on in the paper. They, like constants, will change from line to 
line. 

The project of finding additively closed sets will be easiest when we have additive structures 
of height essentially zero having ambiguity and comity /u. For this reason, we are about 
to define a stylized structure which generalizes this situation. We will eventually use the 
generalized version, replacing A with the blocks Bj. 

We will now define a //-full stylized /9-structure which is r-energetic and has ambiguity 
and comity /i. (The error exponents fi are all the same.) This will be a set {A',G,D) 
where 

|A'| ~ NP, 

(hence a p-structure) where G C A' x A' with 

|G| > Ar2p-o(M), 

(this was the /x-fullness), where D is the set of differences in pairs in G and each difference 
represented > N'^ and < JSf^+Oifj.) ^jj-^gg^ hence r-energetic. Finally we assume that there 
are at least N^P-'^-O(ii) p^jj-g (^x,y) £ D x D so that 

|A'[x]nA'[y]| >iV--OM, 

which is of course the /x-comity. 

We shall say that a set K is /i-additively closed provided that 

\K -K\ < Af^(^)|K|, 

as in Section [21 

Lemma 6.9. There is a function f : [0, 1] — > [0, oo) with 

lim f(t) = 0, 

so that the following holds. Let (A',G,D) be a fi-full stylized p-structure which is r- 
energetic and has ambiguity and comity /x. Then there is an /(/x) additively closed set K 
with 

\K\ > Ar^"-^('^) 
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and a set X so that 

\X\ <iV/M'^'' 



so that 

\A'nix + K)\ >Nf--f^^l 

Proof. We proceed essentially as in the proof of Lemma 16.71 We find x £ D so that there 
is a set y of y e with \Y\ > ATP-OC/^) so that 

|A'G[x]nA'c[y]| >iV-0(^), 

for every y £Y. As before, we use the pigeonhole principle to find a G A' so that there is 
a subset Ya of Y so that for each y £ Ya, we have 

a G A'aiy] 

and so that 

However Ya C a — A'. Thus we think of 1^ as a dense part of a translate of —A'. Now we 
know that 

Y^\A'a[x]nA'a[y]\>N^^^-oi^^l 
y&Ya 

This precisely means that there are A^^+'^^'^('^) triples (6, c, d) with 6, c G A' and d G A'[x] 
with a — b = d — c with a still fixed. Applying Cauchy-Schwarz (see Lemma I2.ip we find 
jyp+2T~o{fj.) quadruples {d,c,d',c') with d,d' G A'[x] and c, c' G A'. As it happens, this 
is precisely the hypothesis of the asymmetric Balog-Szemeredi-Gowers theorem (Theorem 
I2.4p applied to A' and A^lx]. The conclusion follows directly. 

□ 

We are now prepared to state the main result of this section. 

Theorem 6.10. Let A be a robust additively non- smoothing set of strength 5. As before 
choose ji ~ -j — —. There is f '. [0, 1] — y [0,oo) with 

^ 6 

lim f(t) = 0, 

so that for some 7 > 0, there is an f{fi)- additively closed set K with 

\K\ > iyi+T-ZM 
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contained in A. In the event that we must have 7 = 0{f{ji)), for some < a < 1 , we 
may find pairwise disjoint subsets Bi, . . . ,Bm C A with M > _/V<^^0(^) ^^^^ each 
integer 1 < j < m, we have 

J^^T3-a-0{^J.) < in. I < ArS-a+Oip) 

and moreover we find for each j a fi-additively closed set Kj with 

7Y1-/(a') <\Kj\< Afi+ZM, 

together with a set Xj with 

J^2-a-f{^l) < i-v-.i < a72-q+/(m) 

SO that 

Further, there is a set D with \D\ > A^5-2a-/(^) ^^^^ eac/i element of D has at least 
j\ri+o-/(M) representations as a difference of elements of IS. and so that for each j, the set 
of 4-tuples Qj = {{ki, di, A;2, ^2) : ki, k2 £ Kj , di, d2 £ D : ki — di = k2 — ^2} satisfies 

Moreover we may choose Kj to be contained in the set of differences having at least 
jy5~2a-f{ij,) representations as a difference between elements of D. 

Proof. We apply Lemma 16.81 and restrict our attention to 

K 

Gc[jB,x Bj 

i=i 

where the Bj are the blocks obtained there. Now to G, we apply the argument used in the 
proof of Lemma 16.51 

That is, for any element x G —(G), we study 

Ag[x] = {a e a : 3b e a : {a,b) e G; a-b = x}. 
Here —(G) = {a — b: {a,b) £ G}. Then, we observe 

|AGNnAc[y]|>iV9-2"-^('^\ 

x,ye-{G) 

and we observe as before that there is some l + a>/3>l — 0(;u) so that there are at 
least Ar9-2<^-/3-0(M) pairs {x,y) for which |Ag'[x] n AG'[y]| > N^. We note that when this 
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happens, for each a in the intersection Ag[x] n AGr[y], we have x = a — bi and y = a — b2. 
Here if a € Bj, we must have hi G Bj and 62 G Bj, since (a, (a, 62) S G. We argue 
as in the proof of Lemma 16.51 that there are at least A^''~2/3-0(/^) (differences having A^^ 
representations in 

K 

U X 

If not, ^8(A) >> N^^+^. But 

^/3^7-2/3-0(m) < 

which imphes 7 — f3<6 — a + 0{fi), which imphes 1 + a — 0{fi) < (3. This gives us 0(/i) 
comity for G. 

Thus we have a set if of pairs x,y for which |AG[rE] n AG[y]| > jv^^"^*^'-'^-' and so that 

we use the pairwise disjointness of the blocks Bj to write the 

identity 

K 

\AG[x]nAG[y]\ = Yl Yl |AGNnAGb]ni3,|>iv9-2"-o(^) (6.5) 

{x,y)eH j=l {x,y)eH 

Now we begin to use the comity of G. We first eliminate from the second sum in equation 
16.51 all terms for which the relative density of Aq [x] in Aq [y] H Bj or the relative density 
of AG'[y] in Ag[x] n Bj is smaller than N~^'^ for too large a constant G, again using the 
large families principle. By choosing G sufficiently large we do not reduce the sum by a 
factor of more than 2. We dyadically pigeonhole to obtain the largest possible sum from 
those terms where N^'^'^ < \Ag[x] n AG[y] Bj\ < 2N'^~^^ . (We denote this set of (x, y) as 
H^j), Thus we have reduced the sum by at most a factor of log A^. We keep only those j 
for which 

Y \Ag[x] n Aaiy] n B,\ > n'-^"-o(p\ (6.6) 

which we can do without sacrificing much by using again the large families principle. 

We observe that for each x, there are at most A^3-a+0(/i) gjioices of y so that {x, y) G H. 
The reason is that any a G A belongs to at most A/^3-a+0(/i) gg^g AG'[y] because G is 
contained in Uj-Bj x Bj. However if there were more than A^3-a+0(Ai) ^.j^iQ^^gg Qf y gQ that 
(x, y) & H then there would be elements a G AG[a;] which are contained in AG'[y] for more 
than A^3-a+0(^) gj^oices of y Thus, since we have at least ]\[^-'^°'-"/-0{ij-) triples (j, x, y) for 
which 

\AG[x]r\AG[y]r\Bj\>N^+\ 



K 

u 



Bj X B 
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while at the same time 

\AG[x]nBj\ <iV^+7+OM^ 

(by the relative density of Acbj] in ^ci^;] H it must be that for most values of j, we 
have at least N^~^°^~'y pairs G H^j. This means, fixing one such value of j (since at 

most A^3~"+'^(^) values of y are paired with a given x) , there are at least A'^5~2a-7-0(/i) 
differences x with A^^+t representations in G n {Bj x Bj). We call this set Dj^^. 

Thus Gn (i?j X Sj) is //-full and 1 + 7-energetic. Another way of describing the /i-fullness 
is that A^^^2a-7-0(/i) |-^p jyO(^t) factors) is the largest number of such x possible, purely 
based on the size of Bj x Bj. Thus it must be that for a set of size A^5~2«-7-0(^t) j^^any such 
X, there are again, by the large families principle). 

Thus iBj,G n {Bj X Bj),Dj^^) has 0(/i)-comity. Clearly {Bj,G D {Bj x Bj),Dj^^) is a 
3 — a structure with ambiguity Thus we are in a position to apply Lemma 16.91 This 
proves the first part of the theorem. Indeed, since all our estimates were optimal up to 
N^^'^'i factors, there is a set J of choices of j for which we could apply Lemma 16.91 with 

|j| > ^a-O(M). 

To prove the second part, we consider in detail the case 7 = 0. We will apply the argument 
proving Lemma 16.91 to all j G J . This will give us //-additive sets Kj and sets Xj with 
appropriate upper and lower bounds since we can assume A contains no /t-additive sets 
with more than A^i+/(/^) elements. 

We will allow / to vary from line to line and we will express even quantities that are clearly 
0{^i) as /(/t). 

We let D be the set of all differences x for which \/^[x\r\Bj\ > N'^-f(^') for at least iV°-/(^) 
values of j G J. For each value of j G J , there are at least A^s~'^°~/(/^) pairs {x,y) G 
with \Ag[x] n Aciy] n Bj\ > N^-f(^'\ Note that we may also restrict D to differences 
which cannot be represented in more than N^+fM ways as differences of elements of Bj 
for more than A^"^/(^) values in j and so that in each Bj our count of good pairs (x, y) 
consists only of pairs of differences which cannot be represent as differences in Bj in more 
than iVi+/M ways. Otherwise, we could choose 7 > /(/i). 

Now we recall the structure of the argument in Lemma [6.91 We chose an a G Bj and a set Ba 
of size N^~°'~f^^^^ of the differences in which a participates, and a set Ka which is actually 
of the form AclxjO Bj and has size N^~^^^\ We find A^^^"^/(^') additive quadruples made 
up of two elements of Ba and two elements of Ka- We may strip down Ka further to those 
elements which participate in at least A^^~"~/(/^) of these quadruples and not harm our 
estimate on the number of quadruples between Ka and Ba- Now, we note that since Ba is 
a large subset of a translate of Bj, it must be that there are A^2^^(^) pairs ((71,(72) G Ka 
with the property that qi — q2 is represented N^~'^~f(^^) times as a difference of elements 
of Bj. We let Kij be the set of differences of Bj that can be represented in A^3-a-/(M) 
ways as differences in Bj. Because Bj contains no /t-additively closed set of size more 
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than N^+f(^'\ we have that < Ar^+-^('^). Otherwise we could apply the asymmetric 

Balog-Szemeredi-Gowers theorem, to obtain a /x-additively closed set, contained in Bj, as 
in the 7 >> 0{f{iJ,)) case. 

We can replace N^~°'~f(^^') quadruples qi — q2 = xi — X2 with qi,q2 G Kij and xi,X2 G Ba 
by an equation of the form q = xi — X2 with multiplicity at most A^^+/(^). Thus we obtain 
at least A''^~°~/(^) such equations. We rewrite the equation as xi — q = X2 and use Cauchy 
Schwarz to obtain A?^5-a-/(Ai) quadruples xi — q = x'l — q' . We see then that without losing 
more than A^-^(^) factors in our estimates, we can replace Ka by i^ij. This is good since 
we have made it independent of the choice of a. Now we need only show that we can find 
many quadruples not only between Kij and Ba but between Kij and D. This will give 
us the desired result. 

To do this, we observe that we may delete from Ba a set with relative density A?^~-^(^) 
without harming our estimates on the number of quadruplets between Ba and Kij. Our 
goal will be to cover a subset D' by a disjoint union of subsets of the form B'a where B'a is 
a subset of Ba with relative density 1 — N~'f^^^^ . To do this we observe that for any fixed 
ai G Bj 

We can do this because the sum counts triples (x, 01,02) with x a difference and 0,02 arc 
parts of representations of it. We have assumed that we are only dealing with differences 
with fewer than A^i+*^(/*) representations. Here we are using that we are in the 7 = case. 

Now we produce D' as follows. We choose ai and keep the set Ba^ ■ We add all elements of 
Bai to D' We choose to have the minimal possible sized intersection with Ba^ and let 
B'a^ be those elements in Ba2 that are not already in D' . We choose Ba^ to have minimal 
possible intersection with Ba^ U Ba2 ■ We continue in this way until we reach a A; so that 
B'a^ no longer has relative density 1 — N~-^^'^^ in Ba,,. Because the average intersection 
\BanBa> \ is bounded by N^+f(^'\ we get that k is at least Ar2-"-/(/^). Thus our set D' has 
relative density at least N~f^^^'> in D. Thus we have that Kij has N'^~^'^~f(t^) quadruples 
with D' and a fortiori with D. 

Now we slightly refine Ki_j to K2J consisting only to differences of elements of Kij which 
participate in at least A^5^"^/(^) of the quadruples with D'. (We perform this refinement in 
order to prove the very last claim of the theorem.) Since D' is a disjoint union of sets with 
relative density N'f^^^'' inside translates of Bj, it must be that K2J still has N^~°'~f(^^^ 
quadruples with Bj . Thus we can apply the asymmetric Balog Szemeredi Gowers theorem 
to find a subset Kj satisfying the conclusions of the theorem. 

□ 
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7 Structure of spectrum of large capsets with no strong in- 
crements 



In this section, we transfer the results obtained in Theorem 16.101 over to the setting of the 
spectrum of large capsets with no strong increments. This turns out to be rather simple. 
The main ideas which we have not yet taken advantage of are Freiman's theorem and the 
use of the estimate in Proposition 13. 31 which bounds the number of elements of the spectrum 
in a subspace of dimension d by dN^'^'^''. 

We state the main result of the section. 

Theorem 7.1. Let A be the spectrum of large capset without strong increments. There is 

a function f : [0, 1] — > [0, oo] with lim^ >o fit) = so that the following holds. There is a 

subspace H of of dimension N^^^^ and a set A C F-^ of size N'^~f(^^ so that for each 
element A G A, there is a subset Hx C H with the properties that 

I^aI >ivi-/(^), 

and the sets A + Hx are pairwise disjoint subsets of A. For any subspace W C F^ of 
dimension d, we have that W contains at most dN^^^^ elements of A. 

Proof. As before we allow our function / to vary from line to line until we achieve the 
desired result. 

In light of Theorem 12.51 and Proposition 13. 3| any /(e)-additively closed set in which the 
spectrum has A^~./^(^) relative density, must be bounded in size by N^+f^^\ Therefore, we 
are in the 7 = case of Theorem 16.101 We know that there is a set A' of density A^~-^(^) 
in the spectrum A which is contained in 

M 

i=i 

with M > A^'^~'^(/('^)), with each Kj an /(e)-additively closed set (of size at least A^^~^(^) 
and at most A^-^+-^('^)) and with each set Xj of size N'^~"^f^^^ . Moreover, each set Kj 
lies in the set K of differences having at least A^5~2a-/(e) representations as differences 
of elements of D, the differences among elements of the spectrum which have at least 
jVi+a-/(e) representations. In light of the non-additive smoothing property of A, we have 
that \K\ < N^+fi') since there can be at most A^^^ 4a+/(e) quadruplets among elements 
of D. We may eliminate all elements q of each Kj for which A D q + Xj does not have 
size at least Af2-a-/(e)_ 

we let K' be the set of elements of K which appear in at 
least N"~^^^^ many Kj. We can find some Kj which has intersection of size N^~^^^^ with 
K' . Then K' D Kj = K" is a /(e)-additively closed set with cardinality at least N^~^^''\ 
Moreover each element of K" is contained in A^"~.^('^) many Kj. Thus by pigeonholing 
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there are at least A^"^/W many Kj so that \Kjr\K"\ > N^^-f^^\ We only keep these j and 
replace Kj by Kj n K". But by Theorem 12.51 we have that K" is contained in a subspace 
of dimension N-^^"^^ which we call H. 

This basically proves the first part of the theorem. We have that a subset of the spectrum 
of density N~-f^'^^ is contained in K" + X, where X is the union of the Xj^s. We will pick 
A and the sets Hx as follows: Find xi in X so that at least A^^~-^(^) elements of A are 
contained in xi + K". Let Ai be the elements of A contained in X + K" but not in xi + K" . 
Let A^ be those elements of A contained in xi + K" and let Hx^ = A^ — xi. Note that Hx^ 
is contained in K" and therefore in H. Now we proceed iteratively. Find Xj G X so that 
there are at least A^^~/(^) elements of Aj_i in xj + K" . When this is no longer possible, 
we terminate the process. Then we let Aj be the elements of Aj_i not in Xj + K" and we 
let A-' be the ones that are. We let H^^ = A-' — Xj. We let A = {xj} after the iteration 
has terminated. Note that if 

\{xj + K")r\^j-i\ «N^'f^'^ 

for all remaining Xj in X, then 

\(}{x, + K")\>N'^-f^^\ 

i=l 

so |A| >iV2-/(.)_ 

To prove the second part of the theorem, let S be any subset of A with some cardinality 
M. But S + H contains at least MN^~f'^''^ elements of A. This contradicts Proposition 
13.31 unless the span of S has dimension at least MN^^''\ 

□ 

8 Contradiction 

The goal of this section is to obtain a contradiction from the existence of large capsets 
without strong increments by using the result of Theorem 17. li We begin by recording 
some easy consequences of Plancherel's identity for the interaction between the Fourier 
transform of the characteristic function of a set and the Fourier transforms of its fibers 
over a subspace. 

For any set A C , we define its Fourier transform 

Mx) = -^^eia-x). 

We state Plancherel's identity: 
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Proposition 8.1. 

y: ii(x)p=3-^i^i. 

We let H he a subspace of and we let H-^ be its annihilator. We let F be a subspace 
of the same dimension as H which is transverse to H-^ i.e., V + = . We define the 
fiber Ah^v for u G F by 

Ah,v = Ar\{H^ + v). 

If we have h G H, then 

Thus we arrive at another form of Plancherel: 
Proposition 8.2. 

^|i(/i)P = |F|3-2^^|AH,.|^ 

Moreover 

Next, we consider the situation where we have a subspace H C F^ and a larger subspace 
K with H C K C F^ . We let y be a subspace transverse to H-^ as before and we would 
like to consider the Fourier transforms of the fibers of A, namely the sets Ah,v We can 
think of each fiber Ah,v as being identified with a subset of H-^ (by translation by v) and of 
course H can be identified with F^/H. That is, we define functions Ah^^ : F^ /H — > C 
by 

Ah,v{w) = ^ Y e{{a-v)-w). 

The function Ah^^{w) is well defined on F^\H since a — w is in . Next, we write down 
a version of Proposition 18. 21 which shows how the L? norms of the Fourier transforms of the 
fibers on K\H with the norm of the Fourier transform on K. We let W he a. subspace 
transverse to with V C W. 

Proposition 8.3. With H , K , V , and W as above, 
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Proof. Since clearly we have K-^ C H^, there is a unique subspace W C W with W' + 
/ST-L = iJ-L. We have F + W"' = W". 

We consider the following function on W, 

\A\ 

g{w) = \AK,y,\-j^^. 

Clearly, in light of the second part of Proposition 18.21 the left hand side of the identity we 
are trying to prove is the normalized square of the norm of the function g{w). 

We now break up g as the sum of a function g^ which is constant on translates of W 
and functions g^ with v running over V having mean zero and supported on the translate 
V + W' of W . Clearly the functions go and {gv} are pairwise orthogonal. The first term 
on the right hand side of the identity is the normalized square L? norm of the function 
go. The second term on the right hand side represents the sum over v of the normalized 
square LP' norm of the functions g^ . The identity is then an application of the Pythagorean 
theorem. □ 



(We remark that Proposition 18.31 can simply be thought of as Plancherel for a "local Fourier 
transform" of A. Here, we localize to the translates of H^. 

Now we are prepared to apply Proposition 18.31 to the setting in which A is large capset 
without strong increments and H is the subspace given to us by Theorem 17.11 

We let ^ be a large capset with no strong increments. As usual, / will be a function taking 
[0, 1] to [0, cxd] with lim^ f{t) = 0. We will vary / from line to line. 

Then there is a subspace H with dimension N^^'"'^ and a set A of size A^2~-^(^) so that for 
each A G A there is a subset H\ of H, so that \H\\ > A^^~"-^(^) so that for each h E we 
have that 

\A(h + X)\>N-^-f'^'h~^\A\. 
We also have that the sets A + H\ are pairwise disjoint. 
Note therefore that 

^ ^ |i(/i + A)|2> Ar-/WiV3-2^|^|2> Ar-/W3-iV|^|. 



Thus the structured elements of the spectrum of A account for a large proportion of the 
squared Lp norm of the Fourier transform of A. 

Now we would like to consider the fibers Ah^v, where H is the subspace we've been dis- 
cussing. Because the capset A has no strong increments, we know that for each value v, 
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we have 

\AH..\<i\^)il + Nf^^^-'). 
\ii I 

We will now momentarily fix the function /. 

However, we don't have a good lower bound on |^_f/,t)| in general. All we know is that sum 
of all the positive increments is equal to the sum of all the negative increments. (See the 
proof of Proposition 13.31 ) We let Vbad be the set of all v G V for which 

I^H,.|<(^)(l-iV^^-^). 

(That is Vbad is the set of those v €V for the which the fiber has a bad negative increment.) 
We know that 

\Vbad\ < N-Vm\v\. 

We define 

Abad = Ah,v, 

and we let 

A' = A\Abad- 

We know that 

\Abad\<N-Vm\A\. 

Thus 

E\^badix)\'<N-'^Yl E \Mh + x)\'. 

X AeA heHx 

Thus removing Abad does not perturb the large spectrum of A too much. 

(In making this precise, we now resume changing the function / from line to line, observing 
that we may take the next / to be larger than the previous vT-) We may find a set A' 
which is a subset of A with |A'| > A^2-/{e) ^j^^t for each A G A' there is a subset H'^ 

of H so that for each A G A' and each h G H'^ we have 

\A'{h + \)\>N-^-f'^'h-^\A\. 

Thus from the point of view of the structure of the spectrum, we have that A' is essentially 
as good as A. However, the set A' has a big advantage over A in that we have good bounds 
on the Fourier transform of its fibers. This is because the fibers are either empty or close to 
size Empty fibers achieve no increments. On the other hand, fibers which are close to 
average cannot have an increment too large, or else the set A will have a strong increment 
on a translate of a codimension 1 subspace of . Precisely, the estimate we get is 

l^k«(2;)| < ?>-^\A\N-'^+f^'^ = pN^'^+f^'l (8.1) 
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(This is because the negative increment on density to pass from A to A'j^ ^ does no more 
than to reduce the density by a factor of 1 — N^^^^~^ and the codimension of H is only 
/(e).) Should the Fourier transform of A'j^ ^ exceed the above bound, then A will have a 
strong increment into a codimension one subspace of H-^ + v. 

Moreover, the set A'jj y is a large capset without strong increments (on subspaces of codi- 
mension no more than ^—N^^''^) but with e replaced by /(e). Thus we see using Proposition 
that for any subspace L of F^\H with dimension d, we have that 

1^1 \2^AT~l+f{e) 



This estimate is a version of the bound on the number of elements of the spectrum of y 
in W. 

Prom this information together with the fact that no subspace K of dimension d contains 
more than dN^'^'"'^ elements of A and hence of A', we are ready to achieve a contradic- 
tion. 

We introduce some measure spaces on which we will apply Lemma 12.21 Por each u S V, 
we define Xv, a measure on F.^\H — {0} by 

X.{X) = \A'hA^)\'. 

Clearly the total measure of Xv is bounded by 3~^|j4| A^-^(^), by Plancherel. We now give a 
construction for sets which have large density for many of the measures Xv- 

Let E C A' with \E\ = d < N'^^f^^\ Then in hght of the fact that E + H contains at 
least dN^^f^""^ points x where A'{x) is large and in light of Proposition 18. 3| there must 
be > N~^^^^\H\ values of v for which spaniE) has density at least dN~'^~^^''^ for Xv, 
by the large families principle. (Here, by slight abuse of notation, the span is taken in 
Fi^\H.) 

We will take d ~ . Now let H be any collection of A^l+'^(v^) subsets of A' with 
cardinality Na . We chose the exponents | and | somewhat arbitrarily. In particular, they 
allow us to apply Lemma 12. 2[ By pigeonholing, we may find a set of w's of cardinality 
|ff |JV~-^('^) for which at least N3~-f^''^ of the sets H in H have the property that span{E) 
has density N~3~f^^^ for Xv Prom this and Lemma 12.21 we obtain the lower bound 



Xv{span{Ei) n span{E2)) ^ , ^,8 
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What this says is that modulo A^-'^*^'^-* terms, the average density (in Xv measure) of an 
intersection span(Hi) n span{r.2) is at least approximately N~i . This is rather large since 
the density of any single element of F^\H is only approximately A^~^ in light of the bound 
on the Fourier transforms of the fibers, which follows from (jS.ip . 

We will now produce a random construction of 11. We choose each set of 11 uniformly at 
random. We will write down an upper bound on the expected density in any Xv of the 
intersection span(Hi) n span{'r.2) which contradicts the lower bound [8^ We observe that 
if p is the density in Xv of spani^i) n span{^2) then 

p<Ar-3+o(/(^))(3fc_l)^ 

(because Xv{x) < A^~4+0(/(e)^^ where k is the nullity of the set Hi U H2. We recall that 
there are no more than A^3+/('^) elements of A' in any subspace of dimension 2A'"3. Thus 
as we choose the 2A^3 elements of Hi and H2, the probability of introducing nullity at each 
selection is bounded by A^^3+^('^). Thus the probability p^ of having nullity k is bounded 
by 

Pk < {N-l+f(^^)\ 

Thus the expected density of span{Ei) D span{E2) is 0{N~~s"^f^'^'^). This contradicts (|8.2p . 
so we have just proved that A cannot contain a set as thick in A + H as required by 
Theorem 17. H which is a contradiction. 

Thus we have proved Theorem 11.41 and hence Theorem 11.11 

9 Epilogue on efficiency 

The e which we obtain in Theorem 11.11 is of necessity quite small. Like many arguments 
in analysis, ours does not aim for great efficiency and we seem to lose a factor in the size 
of e in nearly every line of the argument. However, there are a few points in the argument 
where we lose significantly more. The most notable of these are the use of the asymmetric 
Balog Szemeredi Gowers lemma and the discovery of the additive structure with comity 
in Corollary 16.61 What these two parts of the argument have in common is that they 
are iterative. One starts with a structure and seeks a certain property. If the property is 
lacking, one finds a structure which is in a certain sense better but which has cost us a 
factor in e. 

Iteration can be a powerful thing. It sometimes allows us to prove a very deep thing 
with very few words. The proof doesn't care how many iterates must be calculated to 
implement it. We have a version of our argument which we will publish elsewhere that 
adds even another layer of iteration to the argument but spares us some of the difficulties 
of Theorem 16. 101 But certainly, it is less efficient than the argument presented here. 
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To improve efficiency, it is important to make the argument less iterative. In hopes of 
doing so, we leave the reader with two conjectures: 

Conjecture 9.1. Let N > be a large parameter. (Constants may not depend on N.) 
Suppose B and C are sets in F-^ each of whose cardinality is bounded by N^^^ . Suppose 
the number of quadruplets bi + ci = 62 + C2 with ^1,62 S B and ci,C2 G C is at least 
\B\\C\'^N^'^ . Then C is contained in a subspace of dimension N'-^^"\ 

Conjecture 9.2. Suppose A is a subset of which is additively non- smoothing. (We 
may assume as in the previous conjecture that |A| < N^^^ . Then A admits an additive 
structure at some height with N^^'^'> comity. 
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